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ABSTRACT 

We use a large sample of galaxy groups selected from the SDSS data release 4 with an adaptive 
halo-based group finder to probe how the clustering strength of groups depends on their masses and 
colors. In particular, we determine the relative biases of groups of different masses, as well as that 
of groups with the same mass but with different colors (either that of the central galaxy, or the total 
color of all group members). In agreement with previous studies, we find that more massive groups 
are more strongly clustered, and the inferred mass dependence of the halo bias is in good agreement 
with predictions for the ACDM concordance cosmology. Regarding the color dependence, we find 
that groups with red centrals are more strongly clustered than groups of the same mass but with blue 
centrals. Similar results are obtained when the color of a group is defined to be the total color of 
its member galaxies. The color dependence is more prominent in less massive groups and becomes 
insignificant in groups with masses <^ 10^^ /i~^M0. These results are consistent with those obtained 
by Yang et al. from an analysis of the 2dFGRS, but inconsistent with those obtained by Berlind et 
al. , who also used an SDSS group catalogue. We construct a mock galaxy redshift survey from the 
large Millennium iV-body simulation that is populated with galaxies according to the semi-analytical 
model of Croton et al. Applying our group finder to this mock survey, and analyzing the mock data in 
exactly the same way as the true data, we are able to accurately recover the intrinsic mass and color 
dependencies of the halo bias in the model. Interestingly, the semi-analytical model reveals the same 
color dependence of the halo bias as we find in our group catalogue. In halos with M ~ 10^^ /i~^M0, 
though, the strength of the color dependence is much stronger in the model than in the data. We 
discuss these results in light of the assembly bias of dark matter halos and the star formation histories 
of galaxies. 

Subject headings: dark matter - large-scale structure of the universe - galaxies: halos 



1. INTRODUCTION 

In the standard cold dark matter (CDM) paradigm of 
structure formation, virialized CDM halos are considered 
to be the building blocks of the mass distribution in the 
Universe. The properties of dark matter halos, as well as 
their formation histories and clustering properties, have 
been studied in great detail using both numerical simu- 
lations and analytical approaches such as the (extended) 
Press Schechter (1974) formalism. These studies have 
shown that halo bias is mass dependent, in that more 
massive halos are more strongly clustered (e.g.. Mo & 
White 1996; Sheth & Tormen 1999; Sheth, Mo & Tormen 
2001; Seljak & Warren 2004; Tinker et al. 2005). This 
mass dependence of the halo bias has played a crucial 
role in our understanding of the correlation function of 
both dark matter and galaxies, via the halo model (e.g., 
Cooray & Sheth 2002), the halo occupation model (e.g., 
Jing, Mo & Borner 1998; Seljak 2000; Peacock & Smith 
2000; Berlind & Weinberg 2002; Magliocchetti & Por- 
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ciani 2003; Berhnd et al. 2003; Zheng et al. 2005), and 
the conditional luminosity function (CLF) model (e.g., 
Yang, Mo & van den Bosch 2003; van den Bosch, Yang 
& Mo 2003; Vale & Ostriker 2006; Cooray 2006; van den 
Bosch et al. 2007). 

Observationally, the clustering strength of galaxy sys- 
tems (groups and clusters) is found to increase with their 
mean separation d = n^^^'^, with n the number den- 
sity of objects (e.g. Bahcall & West 1992; Zandivarez et 
al.' 2003; Padilla et al. 2004; Yang et al. 2005b). Assum- 
ing that richer groups are statistically associated with 
more massive halos, one can estimate the halo masses 
of galaxy groups using a theoretical halo mass function, 
such as the one given by the current ACDM model (see 
Yang et al. 2005b; Berhnd et al. 2006a; Yang et al. 2007). 
Thus the relation between mean separation and cluster- 
ing strength can be converted into a relation between 
halo mass and halo bias. As shown in Yang et al. (2005b), 
the mass dependence of the halo bias obtained this way 
from galaxy groups is in good agreement with the pre- 
diction of the current ACDM model. 

In addition to the mass dependence, recent simulations 
have shown that halo bias also depends on the assem- 
bly history. Using A^-body simulations, Sheth & Tormen 
(2004) studied the environmental dependence of halo as- 
sembly and found that dark matter halos in dense envi- 
ronments assemble at slightly earlier times than halos of 
the same mass in low density regions. Gao et al. (2005) 
used a very large, high-resolution numerical simulation of 
structure formation in a ACDM cosmology to reexamine 
the mass and age dependence of halo bias. They found 
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that, for halos of M < 10^^ H^^Mq at redshift z = 0, the 
bias factor depends not only on halo mass but also on the 
halo assembly time. This result has been confirmed by a 
number of subsequent investigations (Harker et al. 2006; 
Wechsler et al. 2006; Wetzel et al. 2006; Gao & White 
2007; Jing, Suto & Mo 2007). The origin of this assembly 
bias has recently been investigated in a number of papers 
(Wang, Mo & Jing 2007; Keselman & Nusser 2007; Hahn 
et al. 2007). 

If the properties of galaxies that reside in haloes of a 
given mass correlate with the assembly history of that 
halo, the assembly bias will impact the galaxy bias, 6gai, 
to the extent that 6gai not only depends on halo mass, 
as generally assumed in HOD and CLF modeling, but 
also on the formation history of that halo. Using a 
very crude method for assigning galaxy luminosities to 
dark matter haloes. Reed et al. (2007) conclude, how- 
ever, that the luminosity- weighted galaxy ages cannot 
closely trace the assembly epoch of their dark matter 
hosts, otherwise the observed color-dependence of the 
clustering can not be reproduced. Using a galaxy for- 
mation model grafted on to the Millennium Simulation, 
Croton, Gao & White (2007) compared the original sim- 
ulation to 'shuffled' versions in which the galaxy popu- 
lations are randomly swapped among haloes of similar 
mass. They found that the shuffled versions have corre- 
lation functions that are different from the original ones 
at the 5 to 10 percent level, indicating that galaxy prop- 
erties are (weakly) correlated with the halo assembly his- 
tory. Unfortunately, since their semi-analytical model is 
not a perfect representation of the true universe (see e.g., 
Weinmann et al. 2006b and Baldry et al. 2006), it is un- 
clear whether such a correlation also exists in reality. 

To test this, various authors have examined whether 
there is any sign of assembly bias in the large scale struc- 
ture of the observed galaxy distribution. Using a galaxy 
group catalogue constructed from the 2-degree Field 
Galaxy Redshift Survey (2dFGRS; CoUess et al. 2001), 
Yang et al. (2006) examined how the clustering strength 
of galaxy groups depends on the star formation activity 
of the central galaxies. At fixed mass they found that 
the bias of galaxy groups decreases as the SFR of the 
central galaxy increases. Assuming that galaxies with a 
more active SFR are bluer, this implies that groups with 
a red central are more strongly clustered than groups of 
the same mass but with a blue central. Surprisingly, 
a completely opposite trend was found by Berlind et 
al. (2006a) using the group catalogue constructed by 
Berlind et al. (2006b) from the Sloan Digital Sky Sur- 
vey (SDSS; York et al. 2000). They found that massive 
groups with bluer centrals are more strongly biased on 
large scales than groups of the same mass but with red- 
der centrals. Tinker et al. (2007) used the (projected) 
galaxy correlation function and galaxy void statistics to 
test whether the galaxy content of halos of fixed mass 
is systematically different in low density environments. 
Using data from the SDSS combined with HOD models, 
they find that the luminosity and color of field galaxies 
are determined predominantly by the mass of the halo 
in which they reside and have little direct dependence on 
the environment in which the host halo formed. Since, as 
shown by Sheth & Tormen (2004), the assembly history 
of a halo is correlated with its environment, this therefore 
implies that there is no significant correlation between 



the color of a galaxy and the assembly history of its dark 
matter halo. Finally, Blanton & Berlind (2006)compared 
the actual relationship between galaxy colors and large 
scale density field to that predicted by the null hypothe- 
sis that galaxy colors are only dependent on the mass of 
the halo in which they reside. To this extent, they use 
a similar shuffling technique as Croton et al. (2007) , but 
using galaxy groups in the SDSS rather than dark mat- 
ter halos in a semi-analytical model for galaxy formation. 
They find that shuffled and unshuffled color-density re- 
lations agree to better than 5 percent, indicating that 
the large scale environment of a group has only a very 
mild impact on the properties of its member galaxies. 
Clearly, the current status as to whether galaxy proper- 
ties are correlated with the halo assembly history is still 
very confusing. 

In this paper, we re-examine the dependence of the 
halo bias on group mass and on the colors of mem- 
ber galaxies, using the galaxy group catalogue recently 
constructed by Yang et al. (2007) from the SDSS 
DR4 (Adelman-McCarthy et al. 2006). An important 
improvement of this catalogue over that of Yang et 
al. (2006) and Weinmann et al. (2006a) is that group 
masses are estimated both from the total luminosity and 
the total stellar mass of member galaxies, allowing us to 
examine how the results may be affected by the uncer- 
tainties in the estimate of group masses. Furthermore, in 
order to test the impact of the uncertainties in the group 
finder and in the methods used to assign masses to our 
groups, we use a mock galaxy redshift survey (hereafter 
MGRS) constructed from the Millennium semi-analytical 
galaxy sample (hereafter SAM) of Croton et al. (2006) 
to test how accurately we can recover the true trends in 
the semi- analytical model. 

This paper is organized as follows. Section [2] describes 
the galaxy sample and group catalogue used for our anal- 
ysis, while Section [3] presents our method for estimating 
the group-galaxy cross-correlation function. Our obser- 
vational results for the dependence of the group-galaxy 
cross-correlation on group mass and on the color of the 
central galaxy are presented in Section [H The same re- 
sults, but for the semi-analytical MGRS are presented 
in Section [5l Finally, in Section \6\ we summarize and 
discuss our findings. Throughout this paper, we adopt 
the ACDM 'concordance' cosmology with flm = 0.24, 
r^A = 0.76, h ~ 0.73 and as = 0.77. Distances are 
quoted in units of /i^^Mpc. 

2. GALAXY AND GROUP SAMPLES USED 

2.1. The Observational Samples of Galaxies 

We use galaxy samples constructed from the New 
York University Value-Added Galaxy Catalogue (NYU- 
VAGC, see Blanton et al. 2005b). The version of the 
catalogue used here is based on the SDSS Data Re- 
lease 4 (Adelman-McCarthy et al. 2006). Only galax- 
ies with redshifts 0.01 < z < 0.2, extinction-corrected 
Petrosian magnitudes 14.5 < r < 17.6, and redshift 
completeness C > 0.7 are selected. The r-band and g- 
band absolute magnitudes are corrected to z = 0.1 using 
the X-correction code of Blanton et al. (2003a) and the 
luminosity-evolution model of Blanton et al. (2003b). In 
the apparent-magnitude limits adopted here, the bright 
end is chosen to avoid the incompleteness due to the large 



The clustering of groups 3 



TABLE 1 

Volume-limited Samples of Galaxies 



Sample 


z 


° iMr - Slog 


h 


Ngalaxy (obs.) 


Ngalaxy (mOck) 


VI 


(0.064, 0.127) 


(-22.0, -20.5 




54116 


66484 


V2 


(0.064, 0.157) 


(-22.0, -21.0 




39101 


51627 



Note. — Column 1 indicates the volume-limited galaxy sample ID. Column 2 gives the redshift range of each sample. Column 3 is 
the absolute-magnitude range. Columns 4 and 5 list the galaxy number in each sample for observational and mock SDSS catalogues, 
respectively. 



TABLE 2 
Galaxy Groups in the Observation 



ID 


logA/h 


z 


N group 




CO (Ml/Ms) 




CI {Ml/Ms) 








(Ml/Ms) 


Red 


Green 


Blue 


Hod 


Blue 


(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


(7) 


(8) 


(9) 


Gl 


(12.0, 12.5] 


(0.064,0.127) 


43233/41314 


0.99/0.99 


0.87/0.88 


0.60/0.66 


0.94/0.96 


0.65/0.77 


G2 


(12.5, 13.0] 


(0.064,0.127) 


15614/15450 


1.01/1.01 


0.91/0.92 


0.67/0.72 


0.96/0.99 


0.73/0.84 


G3 


(13.0, 13.5] 


(0.064,0.157) 


9909/10104 


1.03/1.03 


0.94/0.95 


0.71/0.75 


0.99/1.01 


0.80/0.88 


G4 


(13.5, 14.0] 


(0.064,0.157) 


2976/2826 


1.04/1.04 


0.96/0.97 


0.73/0.75 


0.99/1.01 


0.86/0.89 



Note. — Column 1 indicates the group sample ID. Column 2 indicates the mass range of each group sample in terms of log[J\/;i/ /i~^Mq]. 
Column 3 gives the redshift range of each sample. Column 4 lists the group number in the corresponding redshift and mass ranges, where Ml and 
Ms mean group masses estimated from the ranking of group luminosity -L19.5 and stellar mass MsteUar; respectively (Yang et al. 2007). Columns 
5-7 list the means of the central galaxy colors (CO) for groups with red, green (galaxies between red and blue Gaussian peaks), blue central galaxies, 
respectively. Columns 8 and 9 list the means of the total group colors (Cl) for groups which are separated into red and blue subsamples in similar 
numbers. See the text for more details. 



TABLE 3 

Galaxy Groups in the MGRS 



ID 


log A4 


z 


^ group 




CO (Ml/Ms) 




Cl (Ml/Ms) 








{Ml/ Ms) 


Red 


Green 


Blue 


Red 


Blue 


(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


(7) 


(8) 


(9) 


SI 


(12.0, 12.5 


(0.064,0.127) 


46274/44761 


0.98/0.99 


0.86/0.86 


0.53/0.58 


0.89/0.90 


0.54/0.61 


S2 


(12.5, 13.0 


(0.064,0.127) 


16929/16538 


0.98/0.98 


0.87/0.89 


0.57/0.61 


0.90/0.93 


0.59/0.68 


S3 


(13.0, 13.5 


(0.064,0.157) 


11139/11231 


0.97/0.97 


0.89/0.91 


0.55/0.60 


0.92/0.94 


0.61/0.69 


S4 


(13.5, 14.0 


(0.064,0.157) 


3311/3453 


0.97/0.97 


0.92/0.93 


0.47/0.63 


0.93/0.94 


0.53/0.68 



Note. — Similar to Tablc[2] but for galaxy groups in the semi-analytical Mock Galaxy Redshift Survey (MGRS). 



angular sizes of nearby galaxies, while the faint end is 
chosen to match the magnitude limit of the SDSS main 
galaxy sample. In addition, we also include galaxies with 
0.01 < z < 0.2 in the NYU-VAGC that have redshifts 
from alternative sources: 2dFGRS (CoUess et al. 2001), 
the PSCz (Saunders et al. 2000) or RC3 (de Vaucouleurs 
et al. 1991). Our final sample consists of 286563 galaxies. 
In our analysis, we construct two volume-limited samples 
in two bins of the absolute magnitude '^■^Mr — 5 log h: VI 
(-22.0, -20.5] and V2 (-22.0, -21.0] (see Table □ for the 
detailed selection criteria). Note that these two volume- 
limited galaxy samples are different from those used in 
constructing the group catalogues, and are used only in 
measuring the cross correlations. 

2.2. Mock Samples of Galaxies 

In order to check how accurately our methodology al- 
lows us to detect true effects of assembly bias, we con- 
struct a mock galaxy redshift survey (IMGRS) that in- 
corporates the same observational selection effects as the 
SDSS sample. Our IVIGRS is based the model galaxy 



catalogue constructed by Croton et al. (2006) using a 
semi-analytic model of galaxy formation in combination 
with the 'Millennium Run' iV-body simulation (Springel 
et al. 2005) ^. The cosmological parameters adopted 
in the 'Millennium Run' are flm — 0.25, fl\ = 0.75, 
and a CDM spectrum with an amplitude specified by 
(jg = 0.9. The simulation was performed with the code 
GADGET2 (Springel et al. 2005), using 2160^ dark mat- 
ter particles in a periodic cubic box with a side length 
ibox — 500 /i~^Mpc (in comoving units). The mass of 
a particle is 8.6 x 10® h~^MQ. The galaxy catalogue is 
generated based on a semi-analytical model of galaxy for- 
mation which uses the simulated halo merging trees. We 
refer the reader to Croton et al. (2006) for the details 
of the semi-analytic model. Because of the finite mass 
resolution in the simulation, the sample is complete to a 
luminosity limit ^-^M^ — 51og/i ~ —16.6. 

Our construction of the MGRS here is similar to that 
described in Li et al. (2007a; see also Yang et al. 2004). 

I http : / / www. mpa-garching. mpg. de /galfor m/ agnpaper | 
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Fig. 1. — The color-magnitude distributions of galaxies in the SDSS (left-hand panel) and the MGRS (right-hand panel). The dotted- 
and dashed- lines indicate the centers of the two Gaussian used to describe the color distributions at a given luminosity bin (see text for 
details). These are used to split the galaxy populations in red (above the red sequence), blue (below the blue sequence) and green (in 
between the red and blue sequences). For clarity, only 10% of all galaxies in the SDSS and MGRS are shown. 



First, we stack 3x3x3 replicates of the simulation box 
and place a virtual observer at the center of the stacked 
boxes. Next, we assign each galaxy a (a, (5)-coordinate 
and remove the ones that are outside the mocked SDSS 
survey region. For each model galaxy in the survey re- 
gion, we compute its redshift (which include the general 
expansion, the peculiar velocity, and a 35 kms~^ Gaus- 
sian line-of-sight velocity dispersion to mimic the red- 
shift errors in the data), its r-band apparent magnitude 
(based on the r-band luminosity of the galaxy), and its 
absolute magnitude °-^Mj. — 5 log/i which is K + E cor- 
rected to z — 0.1. We eliminate galaxies that are fainter 
than the SDSS apparent-magnitude limit, and incorpo- 
rate the position-dependent incompleteness by randomly 
eliminating galaxies according to the completeness fac- 
tors obtained from the survey masks that are provided 
by the NYU-VAGC (Blanton et al. 2005b). Finally we 
select the same two volume-limited samples (VI and V2) 
from the MGRS as we used for the actual data. The num- 
bers of (mock) galaxies in these two samples are listed 
in the last column of Table [T] Note that they do not 
match the SDSS data perfectly. This mainly owes to the 
fact that the semi-analytical model does not match the 
observed luminosity function perfectly. 

2.3. Random Samples of Galaxies 

In order to measure the two-point correlation func- 
tions, one needs to construct random samples to nor- 
malize the pair counts. To do this, we first generate a 
set of random 'galaxies' according to the luminosity func- 
tion obtained by Blanton et al. (2003b). Here a large set 
of points are randomly distributed within the survey sky 
coverage and each point is assigned a redshift and an ab- 
solute magnitude according to the luminosity function. 
The redshifts are assigned to random galaxies by assum- 
ing that they have a constant number density as a func- 
tion of redshift. Similar to the last step in constructing 
the MGRS, we apply the completeness and magnitude 



limit according to the survey masks provided by Blanton 
et al. (2005b). The total number of 'galaxies' in each 
random sample is about 7.5 times as large as that in the 
corresponding observational or mock sample. 

2.4. Galaxy Group Samples 

Our analysis is based on the group catalogue of Yang 
et al. (2007), which is constructed from the SDSS Data 
Release 4 using a modified version of an adaptive halo- 
based group finder (Yang et al. 2005a). This group finder 
is optimized to assign galaxies into groups according to 
their common dark matter halos. Each group is assigned 
two values of halo mass, one is based on the ranking of 
group characteristic luminosity and is referred to as the 
luminosity-based halo mass (Ml), and the other is based 
on the total stellar mass of the group and is referred 
to as the stellar mass-based halo mass (Ms) (see Yang 
et al. 2007 for details). Note that this method requires 
knowledge of the halo mass function, and is therefore 
cosmology dependent. 

In this paper, we measure the galaxy-group cross- 
correlation function (hereafter GGCCF) to quantify the 
dependence of the clustering on group mass and on the 
color of the group members. To this end, we divide 
galaxy groups into four samples according to their halo 
masses. The criteria used to define these samples are 
listed in Table [H We further divide each of the group 
samples into color subsamples using either the color of 
the central galaxy (CO) or using the total color of all 
group members (CI). Below we describe these color sub- 
samples in more detail. 

• CO subsamples: based on the color of the central 
galaxy. Here, the central galaxy of a group is de- 
fined to be the brightest member in the group, and 
we assume that the location of the central galaxy 
coincides with the center of mass of the group. 
Tests show that our results do not change signif- 
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icantly if the central galaxy is defined to be the 
most massive group member instead. Note that 
only a very small portion ( < 0.6%) of groups in 
which the most massive galaxy is different from the 
brightest galaxy. Recent investigations of the color- 
magnitude distribution of galaxies show that the 
colors of galaxies for a given absolute magnitude 
follows a bi-normal form (e.g., Baldry et al. 2004; 
Blanton et al. 2005a; Li et al. 2006). Following Li 
et al. (2006), we separate the SDSS galaxies into 
118 bins according to their absolute magnitudes, 
°-^Mj. — 5\ogh, and fit the "'^(g — r) color distri- 
bution in each bin using a double- Gaussian func- 
tion. The centers of the Gaussians are shown in 
the left-hand panel of Fig. [T] as solid and dashed 
lines, and are used the split the galaxy population 
in red (above solid line) blue (below dashed line) 
and green (in between the solid and dashed lines) 
subsamples. Using the color of the central galaxies, 
we also split the group catalogue in red, blue and 
green subsamples. The mean colors of the central 
galaxies in each subsample are listed in columns 4 
- 6 of Table [3 

• CI subsamples: based on the total color of each 
group. In this case, we first add up the r-band and 
g-band luminosities of all member galaxies for each 
group to get the total group luminosity Lj-^t, Lg t, 
respectively. The groups are then separated into 
red and blue subsamples of equal size (i.e., with 
equal numbers of groups) according to the group 
color 2.5 * (logLr,* — logig,t). The mean colors of 
the groups in each subsample defined in this way 
are listed in columns 7 and 8 of Table [2] 

We have also selected galaxy groups from the MGRS 
using exactly the same group-finding algorithm as ap- 
plied to the real sample (see Yang et al. 2007 for more 
details). Here again we assign to each selected group 
two values of mass, one (Ml) based on the characteristic 
group luminosity and the other (Ms) based on the group 
stellar mass. The groups selected from the MGRS are 
also divided into samples of different masses and into sub- 
samples according to the color of the central galaxy and 
according to the total group color, respectively. The in- 
formation about about these samples can be found in Ta- 
ble[3]with the same format as in Table[21 When modeling 
the color-magnitude distribution of the mock galaxies, we 
separate the galaxies into 113 bins of "-^M^ — 5 log h and 
fit the '^'^{g — r) color distribution in each bin with a 
double-Gaussian function. The right-hand panel of Fig[T] 
shows the two ridges defined by the peaks of the double- 
Gaussian function. As for the SDSS data, these ridges 
again are used to separate galaxies into red, green and 
blue subsamples. 

3. THE GROUP-GALAXY CROSS-CORRELATION 
FUNCTION 

In this paper, we use the group-galaxy cross correlation 
function (hereafter GGCCF) to study the clustering of 
galaxy groups in the galaxy density field. We choose to 
use the cross correlation instead of the auto-correlation, 
because in this case much larger number of galaxies (com- 
pared to the number of groups) can be used as tracers 



of the underlying density field, allowing a more accurate 
determination of the correlation strength of the groups. 
We estimate the GGCCF in redshift space using the def- 
inition proposed in Davis & Peebles (1983), 



_ CGjs) nji _ 



(1) 



where CG is the count of group-galaxy pairs between a 
group sample and the corresponding galaxy sample; CR 
is the count of group- 'random' pairs between a group 
sample and the corresponding random 'galaxy' samples; 
s is the separation of the pairs in redshift space; no and 
rifl are the number densities of the galaxy sample and the 
random sample, respectively. Group-galaxy and group- 
random pairs are counted in logarithmic bins in s, with 
bin width Alogio(s) = 0.14. Note that the redshift- 
space correlation function is not linearly proportional to 
the bias of the groups. Indeed, the large-scale redshift- 
space mono-pole of the auto correlation function is 

m = * fiP) = * (1 + 2/3/3 + (3^5) (2) 

where 13 — Q^jf/b. So the amplitude of ^(s) increases less 
rapidly with halo mass than ^(r). Assuming fi™ = 0.25, 
the difference in /(/?) between b — 1 and 6 = 2 is 1.33 
to 1.15, respectively. Thus, we caution that the relative 
bias measured from the ^(s) may be underestimated by 
a few percent ( <, 1 — •\/1.15/1.33). 
We also estimate the projected GGCCF, defined as 



W{rp) = 2 1 £,{rp, r^)dr^ = 2 ^(r^, r^^k)^r^,k 
Jo 



k 



(3) 

where C(?'p,r,r) is the GGCCF defined in a similar way 
to Eq. ^ , but as a function of the separations of group- 
galaxy pairs perpendicular (rp) and parallel (r^r) to 
the line-of-sight. Here, group-galaxy and group-random 
pairs are counted in logarithmic bins in Vp, with a bin 
width A logiQ^Vp) = 0.17, and in linear bins of r^, with a 
bin width Ar^ = 1 /i^^Mpc. In practice, the summation 
in ([3]) is carried over k from 1 to 40, corresponding to 
< < 40 h-^Mpc. 

Since we have two galaxy samples VI and V2, we can 
make different combinations of the group and galaxy 
samples so that the depths of the samples to be cross 
correlated match the best. Unless stated otherwise, we 
will use VI to cross-correlate with the group samples Gl 
and G2 (or SI and S2), while we use V2 to cross-correlate 
with the group samples G3 and G4 (or S3 and S4). 

We estimate the errors of the GGCCF using the boot- 
strap resampling method (Barrow, Bhavsar, & Sonoda 
1984; Mo, Jing & Borner 1992). We compute the GGC- 
CFs for 100 bootstrap samples of both galaxies and 
groups (either real or mock), and estimate the errors 
from the scatter among these GGCCFs. For this pur- 
pose, we measure the covariance matrix of the GGCCF, 
which is then diagonalized. Throughout the paper we 
use the diagonalized terms of the covariance matrix to 
plot the error bars. 

4. RESULTS FROM THE OBSERVATIONAL 
SAMPLES 

4.1. The Dependence of the Group-Galaxy 
Gross-Gorrelation on Group Mass 
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Fig. 2. — Upper panels: the projected GGCCF between the volume-Umited galaxy sample V2 (-22.0, -21.0] and the group samples 
(G1-G4) within different mass ranges as indicated. Lower panels: similar to the upper panels, but for GGCCF measured in redshift space. 
In the plot, AIi^ (left panels) and Ms (right panels) correspond to groups with masses that are estimated from the ranking of group 
luminosity and stellar mass, respectively. 



We start by examining how the group-galaxy cross- 
correlation depends on group mass. To this extent, 
we measure the projected and redshift-space GGCCFs 
between group samples G1-G4 and the volume-limited 
galaxy sample V2. The corresponding results are shown 
in Fig. [21 Upper and lower panels correspond to the 
projected GGCCFs and redshift-space GGCCFs, respec- 
tively. Results are shown for both group mass estimates. 
Ml and Mg, separately in the left and right-hand pan- 
els, respectively. Clearly, the GGCCFs for more massive 
groups have higher amplitudes, although their shapes are 
similar on large scales. Since the same galaxy sample is 
used for all these GGCCFs, this implies that more mas- 
sive groups are more strongly correlated in the galaxy 
density field. Note also that the results for Ml and 



Ms are very similar, indicating that they do not depend 
strongly in which mass indicator we use. 

To quantify the mass-dependence of the cross correla- 
tion amplitude, we measure the relative clustering bias 
for groups of different masses. We obtain the relative 
bias using the average ratio of the GGCCFs in the range 
4.19/i-iMpc < Tp < 19.17 /i-^Mpc or AM h'^Mpc < 
s < 22.98 /i~^Mpc between group samples Gi (i=l,4) and 
Gl. In Fig. m we show the relative bias measured from 
the projected (left panel) and the redshift-space (right 
panel) GGCCFs, as function of halo mass. The increase 
of the bias with group mass can be well modeled with 
a quadratic function, which is also shown in the figure. 
We compare the observed bias - mass relation with the 
prediction of the halo bias model of Sheth, Mo & Tor- 
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Fig. 3. — Left panel: The average relative bias parameters, obtained from the projected GGCCFs in the range 4.19 /i-^Mpc < rp < 
19.17 h~^Mpc shown in the upper panels of Fig. [2] as a function of group mass (triangles) and Mg (squares). The error bars are 
measured based on the l-cr variances of 100 bootstrap re-samplings. The relative bias parameters are normaUzed by the projected GGCCF 
of the group sample Gl. The solid curve shows the best fitting to the relative biases of groups with mass Af^ (triangles). While the 
dashed curve is the best fitting relative biases for groups with mass Mg (squares). Right panel: The same as the left panel, but for 
relative bias parameters obtained using the GGCCFs in redshift space, i.e. the data shown in the lower panels of Fig [3] in the range 
4.64 ?t~^Mpc < s < 22.98 /i~^Mpc. The corresponding best fitting results are plotted as the solid and dashed lines respectively. We also 
show the relative biases obtained from the halo model predictions (Sheth, Mo & Tormen 2001) for the cosmologies we adopted for the SDSS 
observation [Om,r2A,o'g] = [0.24,0.76,0.77] and the Millennium simulations [Sim, f^Ai ""s] = [0.25,0.75,0.9] respectively using long-dashed 
and dot-dashed lines. 



men (2001), adopting the same cosmology as used in the 
construction of the group catalogue (Yang et al. 2007) 
(the long-dashed line), and adopting the same cosmology 
as used in the 'Millennium Run' simulation (dot-dashed 
line). As one can see, the observed relation is well re- 
produced by the model, but it does not allow us to dis- 
criminate between these two cosmological models. These 
results are all in excellent agreement with those obtained 
by Yang et al. (2005b) based on the groups selected from 
the 2dFGRS. 

4.2. The Dependence on Group Color 

Next we investigate how the GGCCF depends on the 
colors of the galaxies in the groups. We first consider the 
case CO described above, in which the color of a group 
is defined by the color of its central galaxy. We measure 
the projected GGCCFs between the groups with all, red, 
green or blue centrals and the galaxy samples VI or V2. 
The relative bias for each case is then obtained from the 
ratio between the projected GGCCF of the case in ques- 
tion and that of the groups with all centrals. The result- 
ing relative bias, 6i-ei, is shown as a function of in Fig. 
m The four different panels correspond to four differ- 
ent group mass bins, Gl - G4, as indicated, where group 
masses are based on Ml. The relative bias relations for 
groups with red, green and blue centrals are plotted as 
dotted, long-dashed and dashed lines, respectively. In 
order to obtain better statistics, the projected GGCCFs 
are measured between the following pairs of group-galaxy 
samples: Gl-Vl; G2-V1; G3-V2 and G4-V2. Note that 
in general groups with red centrals are more strongly 
clustered than groups of the same mass but with blue 
centrals. The effect is somewhat stronger at smaller sep- 
arations and for less massive groups. Groups with green 
centrals show no significant bias relative to the total pop- 
ulation. 
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Fig. 4. — The relative biases fe^cl a function of rp obtained 
from the ratios of the projected GGCCFs of groups with red, green 
and blue central galaxies to that of all groups in different group 
mass bins as indicated. In this plot, the group masses, A/j^, are 
estimated using the ranking of the characteristic group luminosity. 



We can also measure the relative bias using the GGC- 
CFs in redshift space. Fig. [5] shows this relative bias, 
bfci, as a function of the redshift-space separation s. The 
overall behavior of the color dependence of the relative 
bias obtained here is very similar to that shown in Fig. HI 

Note that the results presented in Figs. [4] and [5] are 
based on the halo mass estimate M^. It is interest- 
ing to compare these with the results obtained using 
the mass estimate Ms- Since red groups have larger 
stellar masses than blue groups of the same luminosity, 
these two estimates of halo masses, Ml and Ms, may 
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Fig. 6. — Similar to Fig. U but for groups with masses, Mg, 
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be systematically different for red and blue groups of the 
same luminosity, which may affect the color dependence 
of the GGCCF discussed above. In Figs. [S] and [7] we 
show the relative bias of groups of different color prop- 
erties against the projected separation and redshift- 
space separation s for groups with similar Ms- The color 
dependence of h^-d is clearly significant for groups with 
Ms < IQ^^-^h^^M-Q, though it is weaker than that shown 
in Figs. H and El This can be explained as follows. Be- 
cause Ml is smaller than Ms for redder groups, for a 
given mass bin, the average value of Als of the redder 
subsample defined by Ml is larger than that defined by 
Ms- Consequently, the bias of the redder subsample is 
boosted in the Ml sample because of the increased halo 
masses. Although Ms is arguably a better estimate of 
the halo mass (see Yang et al. 2007), the difference in 
the color dependence of the bias due to the use of differ- 
ent halo mass estimates suggests that further tests are 



required to assess the reliability of our results. We will 
return to this in the next section. 

To quantify the dependence of the relative bias 6rci on 
the color of the central galaxy, we calculate the average 
relative bias factor for groups in each mass bin based on 
the projected GGCCF over the range 4.19 /i-^Mpc < 
Tp < 19.17 h~^Mpc, and the corresponding bias fac- 
tor based on the redshift-space GGCCF over the range 
4.64/i~iMpc <s< 22.98/i-iMpc. The results are shown 
in Fig [8] The upper two panels show the relative biases 
for groups with halo masses based on Ml, while the lower 
two panels are those for groups with halo masses based 
on Ms- The left two panels are results obtained from 
the projected GGCCFs, while the right panels are results 
obtained from the redshift-space GGCCFs. The related 
error bars are estimated from the 1-cr variances of the 100 
bootstrap re-samplings. Note that the average masses of 
groups are slightly different when the group samples Gl - 
G4 are divided into color subsamples. To take care of the 
contamination due to such mass difference, we have cor- 
rected the relative bias of the color subsamples relative 
to the whole sample, using the mean relation shown in 
Fig. [3] according to the average log Mh of the subsample 
in question. Such correction is applied throughout when 
we examine the color dependence of the relative bias. 

As one can see from Fig. [SJ at a fixed group mass, 
the relative bias of groups with red centrals is higher 
than that of groups with blue centrals. The color 
dependence is more significant for groups of lower 
masses, and becomes insignificant in massive groups with 
M > IQi^-s /j-iMq. As already eluded to above, the 
results obtained from the projected and redshift-space 
GGCCFs are very similar, while the color dependence is 
weaker when using Ms to estimate group masses than 
when using Ml- Note that as we will show in the 
next section part of the color dependence is induced 
by systematic error, which is stronger for group mass 
Ml- To illustrate to what extent the color-dependence 
of the group clustering can be compared with the age- 
dependence of halo clustering, we show, as the dot- 
dashed and dot-long-dashed lines, in the lower left panel 
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Fig. 8. — Panel (a): the average relative biases as a function of halo mass (for M^) obtained from the projected GGCCFs shown in 
Fig. |4]in the range 4.19 fe~^Mpc < rp < 19.17 /i~^Mpc. Panel (c): similar to panel (a), but for groups with halo masses Ms- Panel (b): 
the average relative biases as a function of halo mass (for M^) obtained from the redshift space GGCCFs shown in Fig. [5] in the range 
4.64 h~^Mpc < s < 22.98 h~^Mpc. Panel (d): similar to panel (b), but for groups with halo masses Mg. In each panel, we show the 
relative biases for groups with red, green and blue central galaxies using open triangles, squares and circles, respectively. For comparison 
we show also in the lower-left panel the age dependence of the halo clustering obtained by Gao et al. (2005) from N-body simulations. The 
dot-dashed and dot-long-dashcd lines correspond to relative biases for the 20% old and young halos with respective to all halo populations, 
respectively. 



of Fig. El the results for the 20% oldest and the 20% 
youngest halos obtained by Gao et al. (2005). The color- 
dependence of the group clustering is qualitatively simi- 
lar to the age-dependence of the halo clustering. Quan- 
titatively, however, while groups with blue centrals have 
relative bias similar to that of young halos, the bias for 
low-mass groups with red central galaxies is significantly 
lower than that of low-mass old halos. Thus, the color of 
the central galaxy and the assembly history of the host 
halo is correlated, but the relation contains a large ran- 
dom component. 

Following Berlind et al. (2006a), we also determine the 
dependence of the relative bias on the overall group color 
(using the CI subsamples defined in section Yl^ . The 
average relative biases of red and blue groups thus de- 



fined are plotted in Fig. [5] as a function of halo mass*. 
A comparison with Fig. [8] shows that the color depen- 
dence of the relative bias based on CI is weaker than 
that based on CO. The main reason for this is that in CO 
groups are separated into three color subsamples, while 
only two subsamples are used in CI. Consequently, the 
red and blue subsamples in CO are further separated in 
color-space than those in CI (cf. Tabled]). 

All these results regarding the color dependence of the 
halo bias are in good, qualitative agreement with those 
obtained by Yang et al. (2006) from the 2dFGRS, as 
long as red galaxies are mainly the galaxies with passive 
star formation. However, they are completely opposite 

* For completeness, we have also determine the relative biases 
using the average colors of the group members, rather than the 
total colors, which yields results that are virtually indistinguishable 
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Fig. 9. — Panel (a): The average relative bias, obtained using the projected GGCCFs in the range 4.19 ft-^Mpc <rp < 19.17 fe-iMpc 
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to the results obtained by Berlind et al. (2006a) who, us- 
ing the SDSS group catalogue of Berlind et al. (2006b), 
found that (i) blue groups are more strongly clustered 
than red groups of the same mass, and (ii) the color 
dependence is stronger for more massive groups. Since 
the galaxy samples and statistical methods used are very 
similar, these differences more likely result from differ- 
ences in the group-finding algorithms used or the groups 
selected. The general properties of the groups selected 
by Berlind et al. are similar to the groups used here. 
For instance, relationship between group richness and 
halo mass is about the same for both group catalogs. 
However, we do find that the central galaxy luminos- 
ity - halo mass relation in their group samples is quite 
different from that in ours, particularly for groups with 
masses < lO^^-^ h'^MQ. In their Fig. 3(d), Berlind et 
al. (2006a) showed that the central galaxies in low-mass 
halos have three or four distinctive populations (with un- 



filled gaps), which are not present in our samples. Thus 
the groups, especially the low-mass ones, are somewhat 
different in the two catalogs. On the other hand, as we 
will show in the next section using mock samples, the 
use of Ml as group mass induces systematic errors that 
can produce a significant part of the observed color de- 
pendence of the clustering bias factor. Since Berlind et 
al. have also used the group masses estimated using the 
total group luminosities, their results might also be af- 
fected by such systematic effect. Unfortunately, the bias 
due to the systematic effect is expected to be in the op- 
posite direction as color-dependence found by Berlind et 
al. (2006a), namely, it should enhance the clustering 
amplitude of red groups. While it is beyond the scope of 
our paper to identify in detail the origin of the discrep- 
ancy, we use mock samples in the next section to test the 
reliability of our results. 
Before proceeding to the next section, we follow 
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Fig. 10. — The distribution of the velocity dispersions for groups with red, blue and green centrals as indicated. The panels from left 
to right correspond to groups of different masses. The upper and lower panels show the results with A/^ and A/5 as the group mass, 
respectively. The velocity dispersions are estimated for groups with at least two members. 



Berlind et al. (2006a) to check whether the velocity dis- 
persions of the groups in halos of the same assigned mass 
but with centrals of different colors are the same. This 
provides yet another test about the reliability of our mass 
assignment. Here we measure the velocity dispersions for 
groups with at least two members using the gapper es- 
timator described by Beers, Flynn & Gebhardt (1990; 
see also Yang et al. 2005a). Although the estimate of 
the velocity dispersion is unreliable for estimating indi- 
vidual halo mass, the stacking result may be a reliable 
indicator of the mean mass of a set of groups. To show 
that the color-dependent bias is not a result of system- 
atic mass difference between the color subsamples, we 
estimate the mean velocity dispersion of each color sub- 
sample. The results are shown in Fig. [10] for groups of 
different assigned masses, with upper and lower panels 
corresponding to masses based on and Ms, respec- 
tively. The results show clearly that there is no system- 
atic difference between the mean velocity dispersions of 
groups with red, blue and green centrals. Thus, the color 
dependence we found in the SDSS observation is unlikely 
due to systematics in the mass assignments. 

5. RESULTS FROM THE MOCK SAMPLES 

In this section, we use the MGRS constructed from the 
Millennium semi-analytical galaxy catalogue (Croton et 
al. 2006) to investigate the mass and color dependence of 
the GGCCF. The purpose here is twofold. First of all, as 
mentioned above, we want to test the reliability of our re- 
sults against uncertainties in the identification of galaxy 
groups, and in the mass assignments. Secondly, we also 
want to investigate whether the semi-analytical model of 
Croton et al. which is fairly successful in matching ob- 
servational data, actually predicts a color dependence of 
the halo bias and how it compares to the data presented 
here. 

5.1. Dependence on Group Mass 



The GGCCFs for the MGRS have been measured us- 
ing exactly the same method used in Section 14.11 for the 
real data. Fig. [Tl] shows the GGCCFs for mock groups 
in different mass bins. Comparing these results with the 
observational results shown in Fig[2]), one sees that the 
mass dependence predicted by the model is very simi- 
lar to that obtained from the SDSS. Fig. [12] shows the 
corresponding relative bias as a function of group mass. 
As in the observational data, the relative biases are de- 
fined with respect to groups with masses in the range 
1012 _ 1012-5 /i-IMq. The mass dependence of the rel- 
ative bias can again be modeled with quadratic forms, 
as indicated in the panels. As with the SDSS data, the 
results are well matched by the predictions based on the 
clustering of dark matter halos in a ACDM cosmology, 
suggesting that our group finder is quite reliable in group- 
ing galaxies according to their common dark matter ha- 
los. In addition, the halo masses assigned to the groups 
also have to be fairly reliable, otherwise we would not 
have been able to recover the strong mass dependence 
shown. This is also consistent with the tests described 
in Yang et al. (2007), where it is shown that the masses 
assigned to the groups agree with the true halo masses 
with a scatter ;< 0.3 dex. 

5.2. The Dependence on Group Color 

Fig[T2] shows the relative biases of red, green and 
blue groups in the MGRS as function of the assigned 
group mass. Here the colors refer to those of the cen- 
tral galaxies (i.e., the brightest group members), which 
have been determined using the color-magnitude rela- 
tion shown in the right-hand panel of Fig. [T] As for 
the real data, the relative bias is defined relative to 
the GGCCF of all groups in the mass bin. In the 
left-hand panels the relative biases have been estimated 
from the projected GGCCFs using the data over the 
range 4.19 /i^^Mpc < r^ < 19.17 /I'^Mpc. The right- 
hand panels shows the relative bias factors obtained from 
the redshift-space GGCCFs using the data in the range 
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Fig. 11. — Similar to Fig. |2] except that the results are for the MGRS constructed from the Millennium galaxy catalogue. 



4.64 h^^Mpc < s < 22.98 h-^Mpc. The results shown 
in the upper two panels use group masses based on Ml, 
while the lower two panels use group masses based on 
Mg. Compared with the observational results shown in 
Fig. [HI the MGRS reveals a significantly stronger color 
dependence, especially for groups in the lowest mass bin, 

1012.0 _ 1012.5 h-^MQ. 

In order to investigate to what extent this color depen- 
dence of the GGCCF is affected by selection effects, we 
compare the results obtained from the MGRS with those 
obtained directly from the Millennium galaxy catalogue. 
For this purpose, we measure the relative biases directly 
from the Millennium simulation and use the color of the 
central galaxy to separate halos of a given mass into 
red, green and blue subsamples. Here we use the auto- 
correlation function of the dark matter halos (instead of 
the GGCCF) in real space to estimate the relative bias 



of halos in different subsamples. The relative biases are 
defined relative to the correlation function of all groups 
in the mass bin in question, and are estimated using data 
in the separation range 8.98/i~"'^Mpc < r < 44.6ft.~^Mpc. 
The solid symbols in Fig. [13] show the relative bias fac- 
tors of red, green and blue halos thus obtained as func- 
tion of the true halo mass. Clearly, the SAM predicts a 
color dependence. Similar to our results, red halos are 
more strongly clustered than blue halos of the same mass, 
and the color dependence is stronger for less massive ha- 
los. A comparison with the corresponding open symbols, 
which show the results obtained from the mock group 
catalogue constructed from the MGRS, indicates that 
the color dependence in the original data is fairly well 
recovered by our analysis of the GGCCF. In particular, 
when using Ms as halo mass estimator, the true trends 
can be recovered fairly accurately. Quantitatively how- 
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Fig. 12. — Similar to Fig. |3] except that the results are for the MGRS constructed from the Millennium galaxy catalogue. 



ever, there is discrepancy between the original signal in 
the Millennium Simulation and that recovered from the 
mock groups, especially for blue groups when GGCCFs 
in redshift space are used, in the sense that the recovered 
color dependence is stronger than the dependence in the 
original data. The discrepancy is larger when is used. 
The amount of systematics is comparable to the differ- 
ence between red and blue groups in the observational 
sample [see panels (c) and (d) of FigE] . In order to check 
further how the color dependence of the group clustering 
in the SDSS data can be affected by the group finder used 
and the model of halo-mass assignments, we carry out an 
additional test. Here we first randomly re-assign to each 
central or satellite galaxy of a given stellar mass in the 
Millennium Simulation a "'^(g — r)-color according to the 
color-stellar mass relation of the central or satellite galax- 
ies in the SDSS observation. The absolute magnitude of 
each mock galaxy is then re-calculated according to the 
new assigned color. Note that such a color assignment 
according to the SDSS observation makes sure that the 
color distribution of mock galaxies is the same as that in 
the observation, thereby reducing any effects that may 
be produced by the difference in the color distribution. 
Thus, the re-constructed new mock galaxy catalogue has 
a similar color-stellar mass relation as the SDSS obser- 
vation. More importantly, there should not be any color- 
dependence of group clustering in this new mock cata- 
log. Using this catalogue, we generate the MGRS again 
and perform all the same analyses (i.e. finding groups, 
assigning halo masses according to luminosity or stellar 
mass, and calculating the group correlation function) to 
obtain the relative bias of corresponding galaxy groups. 
The results are shown in Fig. [14] as the open symbols 
for groups with red (triangles), green (circles) and blue 
(squares) central galaxies, defined with the color crite- 
ria that is exactly the same as in the observation. For 
comparison, we also show, as the solid symbols, the true 
relative bias based on the original halos in the MGRS. 
The use of halos in the MGRS ensures that we are com- 
paring the same set of halos (groups) in the same large- 
scale structures and the differences between the two are 
simply due to the systematic errors. Ideally, the rela- 



tive biases for the halos in the MGRS should contain no 
color dependence, and the deviation of the solid symbols 
from the null hypothesis, brei — 1, which is at roughly 
the l-CT level, reflects the 'cosmic variance' in the color 
assignment. As can be seen, there is some difference be- 
tween the open and solid symbols, which indicates the 
systematic errors introduced by the group-flnding pro- 
cedure and the halo-mass assignment. The systematic 
errors are < 0.1 for the cases using Ml (see the upper 
two panels of Fig. [Tl)) . and are < 0.05 for the cases 
using Ms (see the upper two panels of Fig. [T4)l . These 
are signiflcantly smaller than the difference shown in Fig. 
[51 Finally we have also made a test in which each cen- 
tral or satellite galaxy in the Millennium Simulation is 
randomly re-assigned a, ^'^(g — r) color according to the 
color- magnitude relation of the central or satellite galax- 
ies in the SDSS. The overall systematics here is slightly 
smaller. Based on all these tests with the mock catalog, 
we conclude that our results are robust against the sys- 
tematics produced by the group-finding procedure and 
the halo-mass assignment. 

6. SUMMARY AND DISCUSSION 

In this paper we have used a large galaxy group 
catalogue constructed from the SDSS Data Release 4 
(DR4) to study how the clustering of galaxy groups de- 
pends on their masses and, at fixed mass, on the color 
of their member galaxies. We use the projected and 
redshift-space two-point cross correlation functions be- 
tween groups and galaxies to measure the relative biases 
for groups of different masses and colors. Our main re- 
sults can be summarized as follows: 

1. The correlation amplitude of galaxy groups de- 
pends strongly on their masses, and is in good 
agreement with the mass dependence of the bias 
of dark matter halos in the ACDM concordance 
cosmology. 

2. For a given mass, the correlation amplitude of 
groups also depends on the characteristic color 
of its member galaxies. Red groups are more 
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Fig. 13. — Similar to Fig. |8] but for results obtained from the MGRS (open symbols with lines connected). Note that, for comparison, 
the solid symbols in this plot are results obtained from the original Millennium simulation galaxy and halo catalogues, and the real space 
auto correlation functions are used in measuring the average relative biases. For clarity, the latter set of data are slightly shifted to left 
hand side. 



strongly clustered than blue groups. This color de- 
pendence is more prominent in low-mass groups, 
and becomes insignificant in groups with masses 
>, 10^"' H^^Mq. These results are in good, qual- 
itative agreement with those obtained by Yang et 
al. (2005b) from an analysis of galaxy groups in the 
2dFGRS, but disagree with the results obtained by 
Berhnd et al. (2G06a). 

The observed color dependence in the data is quali- 
tatively reproduced by the semi-analytical model of 
Croton et al. (2006), but the predicted color depen- 
dence is much too strong for low-mass halos with 
M ~ IQi^/i-i Mq. 

The systematic errors that can be induced by the 
group finder or by the method of assigning masses 
to the groups are discussed. And we conclude that 
our finding of the color dependence of the relative 
bias in the SDSS observation is robust. 



Unlike the mass dependence of the group clustering 
which can be straightforwardly linked to the mass de- 
pendence of the halo bias, an interpretation of the color 
dependence at fixed mass is less straightforward. It is 
tempting to link it to the assembly bias that has re- 
cently been discovered in numerical simulations, and 
which shows that haloes of a given mass that assemble 
earlier are more strongly clustered. The assembly time of 
a dark matter halo, ^main, is typically defined as the look- 
back time at which its main progenitor reaches a mass 
that is half of the halo's present day mass. 

A straightforward, but naive, interpretation of our re- 
sults therefore is that halos that assemble earlier con- 
tain redder galaxies. This would not only explain the 
sign of the color-dependence at fixed halo mass detected 
here, but since the assembly bias becomes less signifi- 
cant for more massive halos (e.g., Gao et al. 2005; Jing 
et al. 2007), it also naturally explains why the color- 
dependence is weaker for more massive groups. One way 
to invoke a positive, causal correlation between the as- 
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Fig. 14. — Similar to Fig. 1131 but for results obtained from the MGRS with randomly re-assigned galaxy colors according to the 
color-stellar mass distributions of the central and satellite galaxies separately in the SDSS observations. See text for details. 



sembly histories of dark matter haloes and the star for- 
mation histories of the associated galaxies, is to assume 
that the epoch of the last major merger, which is strongly 
correlated with the assembly time (Li et al. 2007b), also 
signals the time at which star formation is terminated. 
This may come about if the feedback from merger in- 
duced AGN activity expels (or heats) the gas associated 
with the merger remnant, thus shutting off the star for- 
mation activity (e.g., Di Matteo, Springel & Hernquist 
2005; Hopkins et al. 2006, 2007). Such a merger induced 
truncation mechanism would ensure that halos that as- 
semble earlier contain redder galaxies. 

Alternatively, the observed color dependence may be 
related to the 'archeological' downsizing. It is well known 
that, in hierarchical structure formation, more massive 
halos assemble later (e.g., Lacey & Cole 1993; van den 
Bosch 2002; Wechsler et al. 2002). Therefore, if indeed 
haloes that assemble earlier contain redder galaxies, it 
would imply that more massive haloes (i.e., clusters) con- 
tain bluer galaxies than low mass haloes. This is in clear 
conflict with observations, which show that more massive 



galaxies, which typically reside in more massive halos, on 
average have formed their stars earlier. An attractive so- 
lution to this problem has been proposed by Neistein et 
al. (2006), who introduced an alternative 'halo formation 
time', the time at which the sum over the masses of all 
the virialized progenitors with masses above a given min- 
imum mass, reaches half the present day halo mass. As 
shown by Neistein et al. (2006), contrary to the assembly 
time, tmainj defined as the accretion time of the last main 
progenitor, iaii increases with increasing halo mass, so 
that it provides a natural explanation for the observed 
archeological downsizing that galaxies in more massive 
haloes have older stellar populations. If the star forma- 
tion history of a galaxy is correlated with this forma- 
tion time, it may also explain the color-bias at fixed halo 
mass. The reason is that, as demonstrated in Neistein 
et al. (2006), although the formation time, tail, and the 
assembly time, tmain, of halos are anti-correlated when 
considering the entire halo population, at fixed halo mass 
iaii and imain are positively correlated. Thus, at fixed 
halo mass, a halo that forms earlier will contain a redder 
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galaxy and will also assemble earlier. 
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